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Summary 

We consider a stochastic process driven by a diffusion and jumps. We devise a technique, which 
is based on a discrete record of observations, for identifying the times when jumps larger than a 
suitably defined threshold occurred. The technique allows also jump size estimation. We prove 
the consistency of a nonparametric estimator of the integrated infinitesimal variance of the 
process continuous part when the jump component with infinite activity is Levy. Central limit 
results are proved in the case where the jump component has finite activity. Some simulations 
illustrate the reliability of the methodology in finite samples. 1 
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1 Introduction 

We consider a stochastic process X starting from xq £ 1R at time t = and such that 

dX t = a t dt + otdWt + dJ t , t G]0, T], (1) 

where a and a are progressively measurable processes, W is a standard Brownian motion and 

J is a pure jump process. A jump process is said to have finite activity (FA) when it makes 

a.s. a finite number of jumps in each finite time interval, otherwise it is said to have infinite 

activity (IA). We provide an estimate of a^dt, denoted by IV, given discrete observations 

{xq, Xt x , ...,Xt n }. The estimator is consistent both when J has FA and when the IA component 

of J is Levy. IV stands for Integral of the infinitesimal Variance of the continuous part of X; 

Jq a^dt is also called integrated volatility in the financial econometric literature. When J has 

1 AMS 2000 subject classifications. Primary: 62G05, 62G20, 62M99; secondary: 7M05 37M10. 
The results of this paper were presented: to the 3rd World Congress of the Bachelier Finance Society, 
Chicago, 21-24 July 2004; to the workshop Quantitative Methods in Finance, Newton Institute, Cambridge, 
31/1/05 - 4/2/05; to the 9th World Congress of the Econometric Society, London, 19th - 24th August 2005, 
http: / /www.econ. ucl.ac.uk / eswc2005/ 
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FA we also give an estimate of jump times and sizes, while, when J has IA we can identify the 
instants when jumps are larger than a given threshold. These results have important applications 
in financial econometrics, see the reviews in Andersen et al. (2005) and Barndorff-Nielsen and 
Shephard (2006). 

The method we propose here extends previous work in Mancini (2001) and Mancini (2004) 
allowing for infinite jump activity and very mild assumptions on a and a. 

Nonparametric estimation of the diffusion coefficient a has been studied, in absence of the 
jump component, e.g. by Barndorff-Nielsen and Shephard (2002). For a review see Fan (2005). 
However, the inclusion of jumps within financial models seems to be more and more necessary 
for practical applications (Das, 2002; Piazzesi, 2005; Bates, 2002). In the literature on non 
parametric inference for stochastic processes driven by diffusions plus jumps, several approaches 
have been proposed to separate the diffusion part and the jump part given discrete observations. 
Berman (1965) defined power variation estimators of the sum of given powers of the jumps. 
Recently these have been recovered and developed in Barndorff-Nielsen and Shephard (2004a), 
Woerner (2006) and Jacod (2006). 

Barndorff-Nielsen and Shephard (2004a, 2004b) define and use the bipower and the multipower 
variation processes to estimate J Q T of dt for given values of p, and in particular they focus on 
p = 2. They assume that a is independent of the leading Brownian motion (in the financial 
literature this is called no leverage assumption) and that the jump process has finite activity. In 
particular they build a test for the presence of jumps in the data generating process. Barndorff- 
Nielsen et al. (2006) and Woerner (2006) show that, in particular cases, the consistency and 
central limit theorem of the multipower variation estimators can be extended in the presence of 
infinite activity jump processes. 

Bandi and Nguyen (2003) and Johannes (2004) assume that at = a(X t ),ot = o-(X t ) and that 
J has FA bounded jumps. They use Nadaraya Watson kernels to obtain pointwise estimators 
of the functions a(x) and a(x) and aggregate information about J. Mancini and Reno (2006) 
combine the kernel and the threshold methods to improve the estimation of the jump part and 
they extend the results to the infinite jump activity framework. 

Our contribution to the extant literature can be summarized as follows. First, in the FA case, 
threshold estimation is a more effective way to identify intervals ]tj-i,tj] where J jumped. 
Second, the threshold estimator of IV is more efficient (in the Cramer-Rao inequality lower 
bound sense) than the multipower variation estimators. Finally, the consistency of the threshold 
estimator holds even under leverage and when the observations are not equally spaced, both in 
the FA and in the IA of jump cases. An alternative extension has been made in Jacod (2006), 
where, in order to obtain a central limit theorem, the diffusion coefficient dynamics has to be 
specified. 

The good performance of our estimator on finite samples of realistic length is shown within three 
different simulated models. 

An outline of the paper is as follows: in section 2 we introduce the framework and the 
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notations; in section 3 we deal with the case where J has FA: we show that by the thresh- 
old method we can asymptotically identify each instant of jump. As a consequence we obtain 
threshold estimators of a^ds and of each stochastic size of the occurred jumps. Using results 
in Barndorff-Nielsen et al. (2005) and in Barndorff-Nielsen and Shephard (2006) we show the 
asymptotic normality of IV, whatever the dynamics for a. Moreover we find the asymptotic 
distribution of the estimation error of the sizes of jump under the no leverage assumption and 
when the jump component is a compound Poisson process. Section 4 is devoted to the case 
when the underlying process contains an infinite activity Levy jump part: in a quite simple way 
we show that the threshold estimator of IV is still consistent, even under leverage and when the 
observations are not equally spaced. Section 5 shows the performance of the estimator of TV in 
finite samples within three different simulated models. Section 6 concludes. 

Acknowledgements. I'm sincerely grateful to Rama Cont, Jean Jacod and Roberto Reno for 
the important comments on this work. I also want to thank PierLuigi Zezza. I thank Sergio 
Vessella and Marcello Galeotti who supported this research by MIUR grant number 2002013279 
and Progetto Strategico. 

2 The framework 

On the filtered probability space [ft, (Ft)te[o,T\i •^ r > P)i ^ W be a standard Brownian motion 
and J be a pure jump process given by J\ + J2, where J\ has FA and J2 has IA and is Levy. 
Let (Xt) tg j 0T j be a real process starting from xq £ IR and such that 

dX t = a t dt + otdWt + dJ t , t e]0, T], (2) 

where a, a are progressively measurable processes which guarantee that © has a unique strong 
solution on [0, T] which is adapted and right continuous with left limits (se e.g. Ikeda and 
Watanabe, 1981; Protter, 1990). Suppose that on the finite and fixed time horizon [0, T] we 
dispose of a discrete record {xo,X tl , X tn _ 1 , X tn } of n + 1 observations of a realization of X, 
with ti = ih, for a given lag h, T = nh. 

When J is a pure jump Levy process, we can always decompose it as the sum of the jumps 
larger than one and the sum of the compensated jumps smaller than one, as follows 

J = Ji + J 2 , 

(3) 

J is = Jo I\ x \>i xn(dt,dx), J 2s = Jq J^^x^dtjdx) - v(dx)dt), 

where \x is the Poisson random measure of the jumps of J, fl(dt, dx) = /J,(dt, dx) — v(dx)dt is 
the compensated measure, and v is the Levy measure of J (see Sato, 1999 or Ikeda and Watan- 
abe, 1981). Ji is a square integrable martingale with infinite activity of jump. For each s, 
Var(J2s) = s Jj a ,| <1 x 2 v(dx) = sa 2 (l) < 00. J\ is a compound Poisson process with finite activ- 
ity of jump, and we can also write J\ s = X)i=i7ii where N is a Poisson process with constant 
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intensity A, jumping at times denoted by (Ti) i=1 N , and each 7$, also denoted 7 Ti , is the size of 
the jump occurred at Tj. The random variables ji are i.i.d. and independent of N. 
More generally a FA jump process is of the form J\ s = Hi where iV is a non explosive 

counting process and the random variables 7$ are not necessarily i.i.d., nor independent of N. 



this first jump within if AjiV > 1; by 7 = minj = i..jv T ■ 

Next section deals with the FA case where J2 = 0, while in section 4 we allow J to have infinite 
activity, where J2 is Levy. 

Further notations. 

For any semimartingale Z, let us denote by AjZ the increment Z t . — Z t ._ 1 and by AZ t the size 

Zt — Zt- of the jump (eventually) occurred at time t. 

[Z] is the quadratic variation process associated to Z. 

[Z^]t is the estimator ^™ =1 (AjZ) 2 of the quadratic variation [Z]t- 

T z denotes the sigma-algebra generated by the process Z. 

H.W is the process given by the stochastic integral J H s dW s . 



By c (low case) we denote generically a constant. 

Plim means " limit in probability" ; dlim means " limit in distribution" . 

If i] is a r.v., MN(0,rj) indicates the mixed Gaussian law having characteristic function (f)(0) = 



3 Finite activity jumps 
3.1 Consistency 

An important variable related to X and containing IVp is the quadratic variation at T 



An estimate of [X] T is given by T l j=i,..., m ( x t j ~ x tj-i) 2 , since Plim^c^i^o J2j=i,..., m ( x t 3 ~ 
x tj-i) 2 = [ x ]ti where tt^ t ' is a finite partition {to = 0,t\,...,t m = T} of [0,T], and |vr( T )| = 
maxj \tj —tj-x\. 

We consider in this section the case in which J has FA, so that (J3J) becomes 



Denote by T" the first instant a jump occurs within if AiN > 1; by 7W the size of 




E[e-h 2 <> 2 }. 




(4) 
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and the quadratic variation gives us only an aggregate information regarding both IV and the 
jump sizes. In order to estimate the contribution of afdt to [X]t, the key point is to exclude 
the time intervals where J jumped. The following theorem provides an instrument to 

asymptotically identifying such intervals. 

Theorem 3.1. Suppose that J = Jj is a finite activity jump process where N is a non 

explosive counting process and the random variables 7j satisfy Vt G [0, T] P{ANt ^ 0, ^N t = 
0} = 0. Suppose also that 

su Pi I ft* a,sds\ 

1) a.s. limsup , — < C{ui) < oo 

2) a.s. L a s ds < oo and limsup < M(u) < oo; 

3) r(h) is a deterministic function of the lag h between the observations, s.t. 

Mogi 

lim r(h) = 0, and lim — — — = 0, 

h-*o h^o r(h) 

then for P '-almost all u 3h(u) s.t. V/i < h{u) we have 

Vi=l,...,n, I{A i N=0}&)=I{(& i X)*<r{h)}(u). (5) 

Assumption 3) indicates how to choose the threshold r{h). The absolute value of the incre- 
ments of any path of the Brownian motion (and thus of a stochastic integral with respect to 



the Brownian motion) tends a.s. to zero as the deterministic function y2/iln^. Therefore, for 
small h, when we find that the squared increment (AjAT) 2 is larger than r(h) > 2/iln^ some 
jumps had to be occurred. 

For the proof we need the following preliminary remarks. 

• The Paul Levy law for the modulus of continuity of Brownian motion's paths (see e.g. 
Karatzas and Shreve, 1999, theorem 9.25) implies that 

a.s. nm sup — j^=^= < 1. 

h^0ie{l,...,n} ,/2/ilogi 



• The stochastic integral a.W is a time changed Brownian motion (Revuz and Yor 2001, 
theorems 1.9 and 1.10): defined the pseudo-inverse of (IVt) t , £t = inf{u : IV V > t}, then 

A i (a.W) = B IVti -B IVtii , (6) 

where B is a Brownian motion. 

• As a consequence, under assumptions 1) and 2) of theorem 13.11 by Karatzas and Shreve 
(1999, theorem 9.25) and the monotonicity of the function xln - it follows that a.s. for small h 

| /** a s ds + J? 1 a s dW s \ 
sup -^=^= < A(u;j, 
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where A(cj) = C(u) + \J M{uo) + 1 is a finite r.v.. 

Proof of the theorem. First we show that a.s., for small h, it holds that Vi, I{AiN=o} — 
I{{AiX) 2 <r(h)}i then we will see that a.s., for small h, it holds also that Vi, I{AiN=o} — ^{(AiX) 2 <r(/i)}i 
and that will conclude our proof. 

1) For each u> set Jq ^ = {i £ {1, n} : AjA r = 0}: to show that a.s., for small h, I{AtN=o} — 
I{(A l x) 2 <r(h)} it i s sufficient to prove that a.s., for small h, supj Q h (AiX) 2 < r{h). To evaluate 
the supj Q h (A-iX) 2 , remark that a.s. 

SUD (AiX>2 - sup ( llL^lMl^l V 2/,logi 2Mogl 



In particular, for small h, sup ig Jq h < 1, as we need. 

2) Now we establish the other inequality. For any uj set Ji t h = {i£ {1, n} : AjiV ^ 0}. In 
order to prove that a.s., for small h, Vi, -T^A^o} — ^{(A;X) 2 <r(/i)} it i s sufficient to show that 



a.s., for small h, inf^^ ^(A^X) > r(/i). In order to evaluate inf^^ h v -W remark that 



(A t X) 2 _ (jtL^ + ^-^^/ilog^ 
1,h ' r(fc) " 2MogI r(/i) + 

the first term tends a.s. to zero uniformly with respect to i. Since for small h we have that 
AjiV < 1 for each i, then the other terms become 



7 r « 



f^asds + Mo-.W) 7r(i) 



The contribution of the first term within brackets tends a.s. to zero uniformly on i. Note that 
the assumption on J guarantees that P{j = 0} = 0, thus a.s. 

lim mf — — = hm — — - > hm — — = +oo. ,-, 

h ieJi,/, r(h) h r(h) h r{h) u 



Remarks. 

i) Assumption 1) simply asks for the sequence supj | f* % i a s ds\/ (hlog j^) 1 ^ keeping bounded 
as h — > 0. It is satisfied if, for example, (a s (u>)) s is bounded pathwise on [0, T]. In particular 
assumption 1) is satisfied in a model with mean reverting drift a s = k8 — kX s . 

If we assume that in equation (J2J) a and a are processes having right continuous paths with 
left limits (cadlag), then assumptions 1) and 2) are immediately satisfied, since a.s. such paths 
are bounded on [0,T]. 
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ii) Note that a FA Levy process satisfies that P{^f = 0} = 0, since z^{0} = (jumps occurring 
with zero size are not jumps). E.g. this is the case for a compound Poisson process with Gaussian 
sizes of jump. 

iii) Frequently, in practice, the lag At, = U — ti-i between the observations of an available 
record {xq, X tl , X tn _ 1 , X tn } is not constant (not equally spaced observations). Theorem 13.11 
and thus also ((7J) below, is still valid. In fact if we set h = maxj At{, all the fundamental 
ingredients of the proof of theorem 13, II hold: 

\AjW\ \AjW\ ^ 
lim sup — ^=^= < nm sup — < 1, 

h-^o , e{ i,..., n} ^ 2h log i ft^o i6{ i,...,„ } log 

by the monotonicity of xln ^. Moreover, using (jfjj), it still holds that 

\Aia.W\ „,, s 
a.s. sup — j^=^= < M(u), 

*e{i,...,n} ^2/ilogi 
since a.s. Vi AJV < AU ■ M(u) < hM(u). 

It is asymptotically equivalent to directly compare each (AiX) 2 with the relative r(Aij): a.s. 
for small h we have, for each i = l...,n, 

/ {(A l X) 2 <r(At l )} = J {A 4 iV=0}- D 

Define 

n 

iv = y^(A^) 2 /|( A , X )2< r ( fe )|. 

i=l 

The consistency of IV is a consequence of theorem 3.1, which is needed in order to asymptotically 
identify and exclude each jump instant. 

Corollary 3.2. Under the assumptions of theorem \S.l\ we have 

n „ T 

Plim^2(A l X) 2 I {(AiX) 2< r{h)} = a 2 dt. (7) 
i=1 ~ Jo 

Proof. Since a.s. for small h we have ^{A;7V=o} = ^{(A,x) 2 <r(h)}i uniformly on i, then 

i i 

PlimJ^(/"' a s ds+ r a s dW s ) 2 -Plim^2( r a s ds + V a s dW s ) 2 I 

■ i J ti — 1 J ti.— 1 „' 1 J- ti 1 " ti — 1 



i=l 



*i— 1 •'ti-1 



which coincides with J* Q a^du, since Yli=l(Jt* j «s^s+ /(.' ^s^Ws^-^AjAT^o} — iVy supj(J* t ' a s ds+ 

Z*^ <x s <w s ) 2 o. □ 



Nonparametric threshold estimation 8 

3.2 Central limit theorems 

As a corollary of theorem 2.2 in Barndorff-Nielsen et al. (2005), from our theorem 13, II we obtain 
a threshold estimator of crfdt, which is alternative to the power variation estimator. An 
estimate of crfdt is needed in order to give the asymptotic law of the approximation error 
^™ =1 (AjX) 2 1{(AiX) 2 <r(h)} ~ Jo a t- We reach a central limit result for IV whatever the dynam- 
ics for a. 



Theorem 3.3 (power variation estimator: theorem 2.2 in Barndorff-Nielsen and al. 
(2005), case r = 4, s = 0). If dX = a s ds + a s dW s , where a is predictable and locally bounded 
and a is cadlag, then for h — ► 

7 ^ 

In the light of this result we now state the following asymptotic properties of the threshold 
estimator of IQ. 

Proposition 3.4. Under the same assumptions as in theorem \'J.l\ and the assumptions of the- 
orem 2.2 in Barndorff-Nielsen et al. (2005) we have that 

1 J2i( A i X ) 4l {(^iX) 2 <r(h)} P f T A 

3 h 'Jo ^ 

Proof. By theorem 13.11 

1 E»(A^) 4 I {(AiX) 2< rW} = 1 Ei( A ^) 4/ {A,jV=0} 

3 h 3 h 

The latter coincides with 



Plim - J -±=± -, (8) 

3 h 



since 

•\n / pt 



1 ! a ^ + Ji x ^Wg 4 / {A ^ 0} 4 (/i In i) 2 

Plim 11 < Plim A 4 7V T ^ = 0. (9) 

3 h 3/i 



Finally (JSJ) coincides with f Q T ofdi, as Barndorff-Nielsen et al. (2005) have shown. □ 



Finally, as a corollary of theorem 1 in Barndorff-Nielsen and Shephard (2006) we have the 
following result of asymptotic normality for our estimator IV t- 
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Theorem 3.5 (Theorem 1 in Barndorff-Nielsen and Shephard (2006)). If dX = a s ds + 
a s dW s , where a and a are cadlag processes then, as h — > 



Vh Jo 



where B is a Brownian motion independent of X (recall from the notations that [X^\t = 

Proposition 3.6. Under the assumptions of theorem \3.1\ and if a is cadlag and locally bounded, 
a is cadlag and J- x -measurable, then we have 



YJi=i( ^i x ) 2l {{AiXy<r{h)} ~ Jo CTtdt 

J2i(^i X ) 4l {(AiX) 2 <r(h)} 



AT (0,1). 



Proof. Denoting by Xq the continuous process given by X$ t = J * a s ds + f^a s dW s , for all 
t E [0, T], similarly as in @ 

Sr=i( A ^) 2/ {(A l x)2<r(/ l )} - J T a tdt 



coincides with 



dlim 



dlim 



3 Ei( A ^) 4/ {(A l X) 2 <r(h)} 

Er=i(A,X ) 2 - / T afdt 



|Ei(A<^o) 4 -|Ei(A<Xo)4lA 4 ^ 



The first factor tends in law to y/2 J Q T a\dB u , by Barndorff-Nielsen and Shephard result (2006, 
theorem 1). However (Jacod and Protter, 1998) B is independent on the whole X. Now the 
assumption a £ T ensures that a is independent of B and thus, conditionally on a, B is again 
a Brownian motion and J Q T a\dB u is Gaussian with law AA(0, 2 J Q T cr^du) . Thus, conditionally 
on a, we have that o Jrl oJt — l _^ \\ However the convergence in distribution 
holds even without conditioning. □ 

Remarks. 

i) A comparison with the bipower variation (BPV) estimator shows that the advantages of 
the non parametric threshold method are al least two. 

The threshold estimator of IV is efficient (in the Cramer- Rao inequality lower bound sense), 
in fact we showed that iv t-iVt^ tends in distribution to AA(0, 2), while Barndorff-Nielsen and 

vfty IQ T 

Shephard (2004b, p. 29) show that (under the further assumption that A is a diffusion) the limit 

_ 2 

law is AA(0, + 7r — 3). In particular the threshold estimator is efficient (see Ait-Sahalia, 2004 
for constant a). 
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Moreover, since we asymptotically identify each jump instant, we can apply known estimation 
methods for diffusion processes also to jump-diffusion processes as soon as we have eliminated 
the jumps (see e.g. Mancini and Reno, 2006). 

ii) In Mancini and Reno (2006) we show that it is possible to consider also a time varying 
threshold, which is particularly important for the practical application of the estimator. □ 

By theorem 13. II and by the fact that, for small h, the probability of more than one jump over 
an interval ]ti_i, ij] is low, it is clear that an estimator of each jump instant is obtained through 

AjiV = I{(AiX) 2 >r(h)}- 

Moreover a natural estimate of each realized jump size is given by 

7W = A i X/{( A . x) 2 >r ( / j)}, 

since when a jump occurs then the contribution of f,* 1 a u du + f/* <r u dW u to AjX is asymp- 
totically negligible. In Mancini (2004) we have shown the consistency of each 7W when T — > 00. 
However we only gave a lower bound for the speed of convergence when the coefficients a and 
a are stochastic processes. Here we show that, at least under the no leverage assumption and 
when J\ is Levy, the speed is exactly *Jn. 



supi I J t a s ds\ 

Theorem 3.7. If J is a compound Poisson process, if a.s. limsup fjr < C(ui) < 00 

for some fi > 0.5 (which is the case if a is cddldg), if a is an adapted stochastic process with 
continuous paths, if a is independent of W and N , with E[J^ a 2 ds] < 00, if the threshold r(h) 
is chosen as in theorem lff.il then 

V^( 7 W -7 (i) W>i}) ± MAT (o,T £ a 2 s dN s ^j . 

Proof. 

V^^(7»-7 (i) I { A i iV>i } ) = 

i 

Vn^2^i X I{(AiX) 2 >r(h), AiN=0} + V™ ( A *^{(A l X) 2 >r(/i), A;7V=1} ~ 7^{A;iV=l}) 

i i 

+Vn^2 ^AjX/{( AiX ) 2>r ( fe)i a x n>2} ~ 7 (i) ^{AiAf>2}) : 

i 

by theorem 13.11 a.s. for small h, the first term vanishes. The third term tends to zero in 
probability, since 

i 

P(Ui{A,iV > 2}) < nO(h 2 ) = 0(h). 
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Therefore we only have to compute the 

dlinV^^ (A l X/ {(AiX)2>r(h)iAiAr=1} - 7W I {Ai n=i} 



dhW^ (a*X - 7 «) I {A , 



JV=1} 



dlim-y/n^ f / a s ds + / cr s dW s J /{ Ai Ar=i}- 



(10) 



However, since for small h 



ti— 1 



a s <is 



^{A l Af=i} > e ) < 



P{^C{uj)h^N T > e} < P {/i^ a5 VrC(a;)iV T > e} -» 0, 
as /i — ► 0, (|T0|) coincides with 



dlim / <7s<ZW s i{A j iV=i} • 



Let us compute the characteristic function 



f ' a s dW s 



conditionally on a. 



f** , a s dW s 



Vh 



are independent Gaussian random variables with law J\f ( 



Jt i-1 s 



Since and iV are independent (Ikeda and Watanabe, 1981), our characteristic function equals 



n? =1 £ 



i.eVT 



f t 4 , asdWs 

1 77 I{A X N=1} + ^{AiAT^l} 



1 Q 2rp- r »-l 



1 ) e - Xh Xh ) =n« =1 (i + M- 



(11) 



However maxj \ 9 n i\ — > 0, Y17=i \® ni \ — ^ an( l 

n 



i=l 



" - - I I h — A / i r~T^ - 1 1 J* 



!Z^2 



where, for each i, & are suitable points belonging to Therefore (Chung, 1974, p. 199) (|11|) 



A/ T e—a-'.-i <fo 



e 2 



tends to e JU V J ; which coincides with (Cont and Tankov, p. 78) £7 

the characteristic function of a mixed Gaussian r.v. r]Z where Z(P) = M(0, 1) and r/ 2 



□ 



4 Infinite activity jumps 
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Let us now consider the case when J has possibly infinite activity. Denote 

(12) 



X 0s = J S a t dt + Jg a t dW t , 



Xi = Xo + Ji, 
and note that since JLi^ x 2 v{dx) < +oo, then as e — > 

cr 2 (er) = / ^(cte) -» 0. 

J|z|<e 

In fact our threshold estimator is still able to extract IV from the observed data. The reason is 
that now 

r-T f-T r f-T 



[ a 2 u du+ [ [ x 2 n(dx,du) = [ o\du + V(AJi s ) 2 + V(AJ 2s 

Jo JO J\x\>0 JO <T <r 



and the threshold r{h) cuts off all the jumps of J\ and the jumps of J2 larger, in absolute value, 
than 2-sJ r(h). However such jumps are all jumps of J2 when r(h) — > 0. 

Theorem 4.1. Let the assumptions 1) (pathwise boundedness condition on a), 2) (pathwise 
boundedness condition on a) and 3) (choice of the function r{h)) of theorem \3.1\ hold. Let 
J = Ji + J2 be such that J\ has FA with P{AiN 7^ 0} = 0{h) for all h, for all i = l..n; let J2 
be Levy and be independent of N. Then 

Plim^(A,X) 2 / {(AiX)2 < rW} = f a 2 t dt. (13) 
i=i Jo 

Jacod (2006) proves the consistency of the threshold estimator when the jump process is 
a more general pure jump semimartingale, with the choice r(h) = hr . The proof we present 
here is simpler and it allows to understand the contribution of the different jump terms to the 
estimation bias. Most importantly, the advantage of the approach presented here is that it 
allows to prove a central limit theorem for IV without any substantial assumption on a, while 
in Jacod (2006) an assumption on the dynamics of a is needed in order to get a CLT. This topic 
is further developed in Cont and Mancini (2005). 

To prove theorem 14.11 we decompose X into the sum of a jump diffusion process, X±, with 
stochastic diffusion coefficient and a finite activity jump part, plus an infinite activity compen- 
sated process J2 of small jumps. We use corollary 13.21 for the first term, and we show that the 
contribution of each AjJ2 is negligible within the truncated version ^2™ = i(\X) 2 1{(AiX) 2 <r(h)} 
of [XW] T - 



Proof. Since X = X\ + J 2 , we can write 
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{(A i X)2<r(ft)} 



a 2 dt 



< 



Sj( A i^l) 2 -^{(A 4 X 1 ) 2 <4r(/i)} - f a2 dt 



+ 



(14) 



+2 



{(A l X) 2 <r(/ i )} 



+ 



X^(AjJ 2 ) 2 I{( Ai x)2< r (ft)} 



By corollary 13.21 we know that the first term of the left hand side tends to zero in probability. 
We now show that the Plim of each one of the other three terms of the left hand side is zero. 

Let us deal with the second term: 



(A l X) 2 <r(ft)} - ^{(A.Xi) 2 ^^)} 



E( AiXl ) 2 ( / {(A l X) 2 <r(/ l ),(A l X 1 ) 2 >4r(h)} ~ / {(A l X) 2 >r(/ l ),(A l X 1 ) 2 <4r(h)}) 



If ^{(A i A') 2 <r(fe),(A i Xi) 2 >4r(/i)} — 1) since 



(15) 



2^rjhj - \AiJ 2 \ < - \Aj 2 \ < \AtXi + Aj 2 \ < ^rjh), 



then |Aj J2I > y f(h). Thus a.s. 



E(AjXi) 2 /{( A . X ) 2 < r ( ?i ) j ( A . Xl )2 > 4 r ( fe )} < E(AjXi) 2 / 



{(AiJ 2 ) 2 >r(h)} 



< 



2E(A,Xo) 2 /. 



{(AiJ a )»>r(h)} + 2 E E^'l ^(AiJa) 2 ^^)}- 
i \i=l 



The first term is a.s. dominated by 



2A2/lln ^E / {(A^ 2 ) 2 >rW} 4 °' 



(16) 



(17) 



as h — > 0, since 



1 



1 £[(A,J 2 ) 2 ] 



Mnf nP{(AiJ 2 ) 2 > r(/i)} < hln^- n- 

h h r(n) 



nha 2 (l)- 



r(h) 
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Moreover 



P < 



BE 



^OV < P (Ui{AiN + 0, (A* J 2 ) 2 > r(/i)}) < (18) 



For the second term of (|15|) we note that by theorem 13. II for small h on {(AjXi) 2 < 4r(/i)} we 
have, uniformly with respect to i, A^N = 0. Therefore for small h 

{(A t X) 2 > r(h), (A t Xi) 2 < Mh)} C {(A,X + A^f > r(h)} C 
(W>^luW a ) a > r(h) 



4 J [ v * " 4 

However, by theorem 13.11 for small h a.s. Vi = l..n I... v km-, = and thus 
{(A;X ) 2 >-^} 

^(A i X 1 ) 2 / {(AiX) 2 >rWi(AiXl) 2< 4r(h)} < X/( A ^°) 2j {(A j J 2 ) 2 >^} ' 

i i 

as before in (|17[) . Therefore Q15|) vanishes. 

Let us now deal with (half) the Plim of the third term on the left hand side of (|14|). which 
coincides with 

Plim E A ^ lA ^ 27 {|A i X|<V^),|A i J 2 |<2VKft)} ' (19) 
i 

In fact if \AiX\ < y/r(h) and |AjJ 2 | > 2y/r(h) then 2 v / r(/i) - |A<Xi| < |AiJ 2 | - |AjXi| < 
|AjX| < y/r(h), i.e. |AjXi| > y/r(h), so that |AjJi| + |AjX | > \AiJi + AjX | > \/r(h), and 
then 



either |AA| > or |A^ | > ^ 



Since for small h, uniformly in i, I, 

' \A lXo \>^ h) 



2 2 
: 0, then 



(20) 



pj^lA^A^I^^ 



x|<VnW, |A i J 2 |>2,A : (h)} 



7^0 < 



(21) 



P Ui{|AiJ 2 | > 2y/r{K), AiN ± 0} , 



which tends to zero as in (|18|) . 



In order to deal now with ()19l) . note that if |AjX| < y/r(h) and |AjJ 2 | < 2yV(/i) then 



| AiJil - |A;X + A,J 2 | < |AiX| < vMM, 
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so a.s. AjiV < |AjJi| < \/r(h) + Ay/iln^ + (A^l = O y\/r(h)j , uniformly in i. Therefore 
a.s. for small h, V i on {|AjX| < yVf/ij, |A, J 2 | < 2^r(h)} we have AjiV = 0. Thus CHJ) is 
dominated by 

Phm^|A J X A l J 2 |/ {|AiJ2| < 2 ^ < 



Phm ^E( A ^o)yE( A ^^ 2J { |A,J 2 |<2V^)} = ^ Plim^( yW ) 2 = 

by the Schwartz inequality and remark T4.2I below. 

Finally let us show that last term of the left hand side of Q14|) tends to zero in probability. 
Analogously as in (|21|) 



p|E(a,j 2 ) 2 / {|Ai 



*I<VhX>, |A i J 2 |>2 x A : (h)} ^ f - 



P(U;{A;iV ^ 0, \Aj 2 \ > 2^rjh)}) -» 0, 
so that last term of ()14j) coincides with 

PlimE(A t J2) 2 / {|AiX| <^y 5 \^j 2 \<2^Hh)} ^ Plim E( A * ^) 2 % Ai J 2 )*<4r(h)} = 
i i 

by remark T4.2I □ 



Remark 4.2. A.s., /or small h, uniformly in i, on {(Aj^) 2 < 4r(/t)} we /iaue i/iai all the 
jumps |AJ2, S | are bounded by 2yf r(K), that is 

(AJ 2iS ) 2 < 4r(/i), Va€]ti_i,ti]. 

More precisely on {(Aj^) 2 < 4r(/i)} w;e have J* * / 2 ^(h)<|x|<i x l i iA s ,dx) = 0; therefore 
Aj J2-^{(a J 2 )2<4 r (/ l )} are ^ e increments of the process Y^ h > given by 

JO J|a;|<2- v /r(/i) J2 A /r(fc)<|x|<l 

^4s a consequence 

PlimE(A,J2) 2 / {(Aj2)2 < 4r(h)} = PlimE(A^ W ) 2 = 

i i 

Plim [r {/l) ] T = Phm ft x 2 fi(ds,dx) = 0, (22) 

JO J\x\<lA2y/r(h) 

since last term has expectation o~ 2 (l A 2\J r(h) ) — > 0, as h — > 0. 
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In fact 



r{h) r(h) 

as h — ► 0, meaning that a.s., uniformly on i, X/selt* 1 ti] (A*-^) 2 tends to zero more quickly than 
r(h), that is for small h, for all i = l..n, 5Zse]tj-i tj](A^2,s) 2 < f{h), and thus (AJ2 jS ) 2 < Ar(h) 
for each s 6]tj_i, ti]. □ 

Remarks. 

i) Everything is still valid if we have non equally spaced observations. In fact if we set, as in 
the remark of the previous section, h = maxj Aij, the term E[Ir,^, j 2 )^y r (h)}^ we °^ en encounter 
from equation (|17|) on, is still negligible, since as h = maxAtj — » 0, P{(AjJ2) 2 > r(h)} < 
E[{ ^if ] = ^y 11 < c • On {(AiJ 2 ) 2 < Ar{h)} each jump size (AJ 2 ,,) 2 < 4r(At 4 ) < 4r(h), 
so that d22j) still holds. 

ii) Consistently with the results in Barnodrff-Nielsen et al. (2006) for the multipower varia- 
tions and in Jacod (2006), the asymptotic normality of our estimator of Jq a^ds does not hold 
in general if X has an infinite activity Levy jump component and general cadlag coefficient a 
(Cont and Mancini 2005). Namely the asymptotic normality holds when the jump component 
has a moderate jump activity (when the Blumenthal-Gatoor index a of J belongs to [0, 1[), 
while the speed of convergence of IVt is less than \fh if the activity of jump of J is too wild 
(a€ [1,2[). 



5 Simulations 

In this section we study the performance of our threshold estimator on finite samples. We im- 
plement the threshold estimator within three different simulated models which are commonly 
used in finance: a jump diffusion process with jump part given by a compound Poisson process 
with Gaussian jump sizes; a similar model with stochastic diffusion coefficient correlated with 
the Brownian motion driving the dynamics of X; and a model with an infinite activity (finite 
variation) Variance Gamma jump part. 



MODEL 1. Let us begin with the case of a jump diffusion process with finite activity 
compound Poisson jump part. We generated TV" = 5000 trajectories of a process of kind 

N T 

dX t = adWt + Y^Zi 
i=l 

with Zi i.i.d. with law JV(0, r/ 2 ), where rj = 0.6, a = 0.3 and A is intentionally chosen higher than 
a realistic situation, A = 5, like as in Ait-Sahalia (2004). To generate each path we discretized 
EDS and we took n = 6000 equally spaced observations Xt i with lag h = — so that T = 1. We 
chose r(h) = h ' 9 . Figure 1 shows the distribution of the 5000 values assumed by the normalized 
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bias term 

E"=l( A ^) 2/ {(A a X)2<r(/ Q} ~ Jo rfdt 



I Ei( A ^) 4/ {(A l X) 2 <r(h)} 

versus the standard Gaussian density (continuous line). 



Figure 1: Distribution of 5000 values assumed by the normalized bias term (24), when X has 
constant volatility and compound Poisson jumps (MODEL 1). The continuous line is the density 
of the theoretical limit law jV(0, 1). 



MODEL 2. Let us now consider a process with jump part given by a finite activity compound 
Poisson process and a stochastic diffusion coefficient correlated with the Brownian motion driving 
X. We generated N = 5000 trajectories of a process of kind 



x t = HSt 



where 



dS Nt 
— ^ = fidt + a t dW t {1) + dJ t J t = Y j Z u Zi~M{m G ,v^ 



Note that 



a t = e Ht , dH t = -k(H t - H)dt + r)dW} 2) , d <W {1) ,W {2) > t = pdt. 



dX t = (p- a 2 /2)dt + adW t {1) + m(l + AJ t 



We chose \i = 0, A = 4 and a negative correlation coefficient p = —0.7; then we took Hq = ln(0.3), 
k = 1, H = ln(0.25), rj = 0.01 so to ensure that a path of a within [0, T] varies most between 
0.2 and 0.4. Moreover = 0.001, v = y/0.02 give relative amplitudes of the jumps of S most 
between 0.01 and 0.20. Finally we again took n = 6000 equally spaced observations X^ with 
lag h = — and r(h) = /i 9 . Figure 2 shows the distribution of the normalized bias term l)23|) 
against the asymptotic density (continuous line). 

MODEL 3. Figure 3 shows the distribution obtained in the case of a Variance Gamma 
(VG) jump component. The VG process is a pure jump process with infinite activity and finite 
variation. We add to it a diffusion component aBt'. 



X t = aB t + cG t + rjW Gt . 
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Figure 2: Distribution of 5000 values assumed by the normalized bias term (24) when X has 
stochastic volatility, negatively correlated with W^ 1 ), plus compound Poisson jumps (MODEL 
2). The continuous line is the density of the theoretical limit law Af(0, 1). 



The subordinator G is a Gamma process having Var{G\) = b, B and W are independent 
Brownian motions; we chose N = 5000, n = 6000 and h = 1/n. b = 0.23, c = —0.2 and rj = 0.2 
are chosen as in Madan (2001); a = 0.3 is chosen so that Var(X ± ) = rj 2 + c 2 b + a 2 = .0892 
matches the Var{X\) we obtained for model 1. Finally r(h) = h°". 




Figure 3: Distribution of 5000 values assumed by the normalized bias term (24) when X has 
constant volatility plus a Variance Gamma jump part (MODEL 3). The continuous line is the 
density of the theoretical limit law A/"(0, 1). 



6 Conclusions 

In this paper we devise a technique for identifying the time instants of significant jumps for a 
process driven by diffusion and jumps, based on a discrete record of observations, making use 
of a suitably defined threshold. 

We provide a consistent estimate of IV = Jq a 2 dt, extending previous results (Mancini 2001, 
2004) with very mild assumptions on a and a and, most importantly, allowing for infinite jump 
activity. 

When J has finite activity, we give a nonparametric estimate of the jump times and sizes, while 
when J has a pure jump Levy component with infinite activity we can identify the instants when 
jumps are larger than the threshold. When J has FA we also prove central limit results for IV 
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and for the jump sizes estimates. 

Compared with power variations, multipower variations or kernel estimators the threshold 
method in the FA case is a more effective way to identify each interval ]tj-i, tj] where J jumped. 

We also prove that the threshold estimator of IV is efficient. 
Moreover, our method allows the extension of kernel estimators in diffusion frameworks to pro- 
cesses driven by diffusions and jumps, provided we eliminate the jumps (Mancini and Reno, 
2006). 

The consistency of the threshold estimator holds even under leverage, both in FA and IA cases. 
The threshold technique holds even when the observations are not equally spaced and also when 
the threshold is time varying, which is particularly important for the practical application of 
the estimator. 

The advantage of the approach presented here is that it allows to prove a central limit 
theorem for IV without any substantial assumptions on a, while in Jacod (2006) an assumption 
on the dynamics of a is needed in order to get a CLT. This topic is further developed in Cont 
and Mancini (2005). 

The good performance of our estimator on finite samples of realistic length is shown within 
three different simulated models. 
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